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SUMMARY 

Interesting Work is belli" done in programing computers to 
solve problems which require a high degree of intelligence in 
humans. However, certain elementary verbal reasoning processes 
so simple that they can be carried out by any non-feeble-minded 
human have yet to be simulated by machine programs. 

‘This paper will discuss programs to manipulate in a suita.- 
blo formal language (most likely a part of the predicate cal¬ 
culus) common Instrumental statements, ©is basic program will 
draw immediate conclusions from a list of promises. ©lose con¬ 
clusions will be either declarative or imperative sentences. 
When an imperative sentence is deduced the program- takes a cor¬ 
responding a.ction« These actions may include printing sen¬ 
tences, moving sentences on list3, and reinitiating the basic 
deduction process on these lists. 

Facilities will be provided for communication with humans 
in the system via manual intervention and display devices con¬ 
nected to the computer 


The "advice taker" is a proposed program for solving 
problems by manipulating sentences in formal languages. The 
main difference between it and other programs or prep osed 
programs for manipulating formal languages (the "Logic Theory 
Machine n of Howell, Simon end Shaw and the Geometry Ir-ogram 
of Gelernter) is that in the previous programs the formal sys¬ 
tem was the subject matter but the heuristics were all em¬ 
bodied in the program. In this program the procedures will 
be described as much as possible in the language itself and. 

In particular, the heuristics are all so described. 
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The ^ main advantages wo expect the "advice taker" to have 
is that its behavior will bo improvable merely by making state¬ 
ments to it, telling it about its symbolic environment and 

VIan ' CQ & ^ r om it. To make those statements will require 
1-ttle if* any knowledge of the program or the previous know- 
leage of the ‘advice taker”. One will be able to assume that 

s,r_iv ^ c ? taker will have available to it a fairly wide class 
or immediate logical consequences of anything it is told and its 
previous_knowledge. This property is expected to have much in 
common with wnat makes us describe certain humans as havin'* com¬ 
mon sense. We shall therefore say that: A program has coSmSn 
sense if it automatically deduces for itself a sufficiently 

wide class of immediate consequences of anything it is told and 
what it already knows. 


^The design oi this system will be a joint project wit! 
Marvin Kinsley, but Minsky is not to be held responsible foi 
views expressed here. 
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Before describing the "advice taker" in any detail, 2 would 
like to describe more fully our motivation for proceeding in 
txiis direction. Our ultimate objective is to make programs that 
learn from their experience as effectively as hvrmons do, it may 
not be realised how far we are presently from this objective. 

It is not hard to make machines learn from eznorier-oe to make 
simple changes in their behavior of a kind which has been an¬ 
ticipated by the prog-r. ...mer, or ewaaole, Samuel has included 
An his checker program facilities for improving the weights the 
maexune assigns to various factors in evaluating positions. He 
® , ^ so included a scheme wnereby the machine remembers games 
ft has played previously and deviates from its previous play 
wnen it finds a position which it- previously lost. Suppose, 
however, ^ne.fc we wanted an Improvement in behavior corresnond- 

to the discovery by the machine of the principle of 
the opposition an checkers • Ivo present or presently proposed 
schemes are capable of discovering phenomena as abstract*as this. 

If one wants a machine to be able to discover an abstraction, 
it seems most likely that the machine must be able to represent 
this abstraction in some relatively simple way. 

There is one known way of making a machine capable of 


late aroicrazy fcenaviora and try them out. These behaviors may 
be represented either by nerve nets (ref. 2), by Turing ma¬ 
chines (ref. 3 ), or by calculator programs (refl 4 ). The - diffi¬ 
culty is two-fold. First, in any of these representations the 


anc - e ^'Gn more important, small interesting changes in behavior 
expressed at a high level of abstraction do not have 3 imle 
representations. It is as though the human genetic structure 
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wore represented by a eat of blue-prints. • Then a mutation would 
usually result in a wart or a failure of ports to meet, or* oven 
an UDgrani~;at5.cal blue-print which could not be translated into 
an animal at ell* It is vary difficult to see hot: the genetic 
representation scheme manages to be general enough to represent 
the great variety of animals observed and yet be such that so 


small gGnaceie changes. The problem of hot*: such a ronresentation 
controls the development of a fertilised egg into a mature ani¬ 
mal is even more difficult. 

In our opinion, a system which i 3 to evolve intelligence of 
human order should have at least the follot 7 ing features: 

1. All behaviors must be representable in the system. 
Therefore, the system should either be able to construct 
arbitrary automata or to program in some general purpose 
programming language. 

2. Interesting changes in behavior must be expressible in 
a simple nay. 

3* ^All aspects of behavior except the most routine must 
be improvable. In particular, the improving mechanism 
should be improvable. 

4. The machine must hai r e or evolve concepts of partial 
success because on difficult problems decisive successes 
or failures come too infrecuontly. 

5. The system must be able to create subroutines which 
can be included in procedures as units. The learning of 
subroutines. is complicated by the fact that the effect of 
a subroutine is not usually good or bad in itself. There¬ 
fore, the mechanism that selects subroutines should have 
concepts of an interesting or powerful subroutine whose 
application may be good under suitable condition::. 
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sipns we shall concentrate entirely on this point and attempt 
to. achieve a system which can be told to mates a specific Ira-’ 
provoment in its behavior with no mors knowledge of its internal 
structure or previous knowledge than is required in order to in- 
struct a human. Once this is achieved, we may be able to tell 


The main distinction between the way one programs a computer 
and modifies.the program and the way one instructs a human or 
will instruct the ‘’advice taker” is this: A machine is instructed 
n^ihly in the form of a sequence of imperative sentences: while a 
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the situation in which action is required together with a few 
imperatives that say what is wanted, tie shall list the ad¬ 
vantages of the two methods of instruction. 


■'■ --i 


Advantages 6f Imperative Sentences 

- -- — * - — ------ — — - ““- 1 — 


1. A procedure described in imperatives is already laid out and 
is carried out faster. 

2. One starts with a iEaoh5.no in a basic state and does not assume 
previous knowledge on the part of the machine• 


V 


Advantages of Declarative Sentences 


Hfc,. » «>■!' 


1. Advantage can be taken of previous.knowledge• 

2. Declarative sentences have logical consequences and it can 
be arranged that the machine will have available sufficiently 
simple logical consequences of what it is told and what it pre¬ 
viously knew. 

3. The meaning of declaratives is much less dependent on their 
order than is the case with imperatives. Tills makes it easier 
to have after-thoughts. 

4. The effect of a declarative is lens dependent on the pre¬ 
vious state of the system so that less knowledge of this state 
is required on the part of the instructor. 



verbally 
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THE CONSTRUCTION OF THE ADVICE TAKER 

The "advice taker" system lias the following main features: 

1. There is a method of representing expressions in the.compu¬ 
ter. These expressions are defined recursively as follows: A 
class of entities called terms'is defined and a tens is an ex¬ 
pression. A sequence of expressions is an expression. These - 
expressions are represented in the machine by list structures (ref.1). 

2. Certain of these expressions may be regarded as declarative, 
sentences in a certain logical system which will be analogous to 1 
a universal Post canonical system. The particular system chosen 
will depend on programming considerations but '.will probably have 
a single rule of inference which will combine substitution for 
variables with modus ponens. The purpose of the combination Is 
to avoid choking the machine with special cases of general pro¬ 
positions already deduced. 
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3* There is an immediate deduction routine which when given a 
set of promises will deduce a set of immediate conclusions • 
Initially, the itrmodiabe deduction routine will singly write 
down all one-step consequences of the promises. Later,, this may 
be elaborated so that the routine will" produce some other con¬ 
clusions which stay he of interest. However, fchi3 routine will 
not use semantic heuristics; i.e. heuristics which depend on the. 
subject matter under discussion. 

A 

The intelligence, if any, of the advice taker will not be 
embodied in the immediate deduction routine. This infcelli-. 
gence will be embodied in the procedures which choose the lists 
of premises to which the immediate deduction routine is to fee 
applied• Of course, the program should never attempt to apply 
the immediate deduction routine simultaneously to the list of 
everything it knows• This would make the deduction routine take 
too long. 

4. Hot all expressions are interpreted by the system as de¬ 
clarative sentences. Some are the names of entities of various 
hinds. Certain formulas represent objects. For our purposes, 
an entity is an object if we have something to say ’about it 
ether than the things which may be deduced from the form of its 
name. j?-or example, to most people, the number 3012 is not an 
object; they have nothing to cry about it except what can be de¬ 
duced from its structure. On the other hwed, to moot Americans 
the number 1JJ6 is an object 1: e cause they have filed somewhere 
the fact that it represents the year w hen the American Revolu¬ 
tion started. In the “advice taker 5 ’ each object has a proper¬ 
ty list in which are listed the specific things we have to say 

* w 'y —^t —* 4? • %* j v/.ew ye k.«;t'Ae* v-t.-,ii 'v.vw.Ia'Sv, 1 . Jl 1 OX 

the object may be included in the property list anyhow if the 
deduction was actually carried out"and was difficult enough so 
that the system does not want to carry it out again. 

5. Entities other than declarative sentences which can be repre¬ 
sented by formulas in the system are individuals, functions, end 
programs. 

6. The program is intended to operate cyclically as follows. 

The immediate deduction routine is r.mmXiGd to a list of vreoises 
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deduces and obeys. 
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to act by means of an example. Assume that I am seated at my 
desk and I wish to go to the airport. Ky car is at cy home also. 
The solution of the problem is to wall: to the car and drive the 


car to the airport. First, wo shall give a formal statement of 
the premises the “advice taker” uses to draw the conclusions. 


Than we shall discuss the heuristics which cause the "advice 
taker 11 to assemble these premises from the totality of facts it 
has available. Tho premises cose in groups, and we shall ex¬ 
plain the interpretation of each group* 

* 

1 . First, we have a predicate "at". "at(x,y) n is a formalisa¬ 
tion pf "x is at y"* Under this heading we have the premises 


2 . 

3 • 

4 . 
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at 

at 

at 


at 
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(X, desk) 

(desk, home) 

(car, home) 

(home, county) 
(airport, county) 


We shall need the fact that the relation "at" is transitive 
which might he written directly as 

6. at (x,y), at(y,z) -♦ at(x,z) 

or alternatively we might instead use the more abstract premises 
6 s . transitive (at) 


and 


7*. transitive (u) ~- r - (u(x,y), u(ya,s) ^ u(x,s) ) 


from which 6. can bo deduced. 


2. There ere two rules* concerning the feasibility of walking 
and driving. 

8. walkable(x), at(y,x), at(z,x), at(X,y) *■* can(go{y,s, walking) ) 

9 . firivable (x), at(y,x), at(car,y) at(X,car) can(go(y,z, driving) 


) 


There are also two specific facts 

10. walkable (home). 

11. firivable (county) 

3, Went wa have a rule concerned with the properties of going, 

12. did(go(x,y,s) ) -► at(I,y) 

4 , The problem Itself is posed by the premise; 

±3, want(at(I,airport) ) 
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5. Tho above are Gil the premises concerned 
problem. The last group of xiromises are ccsr 
problems of this sort. They are: 


- ith t-ha particular 
a to almost all 


14, (x can(y) ), (tlicl(y) -> s) “*> canachi.lt(x,y,z ) 

The predicate "can&chult (;:,y,z ) a means that in a situation to 
which 2 : appI5.cs, the action y can he performed and brings about 
a situation to which s applies. A sort of transitivity is de¬ 
scribed by 

15 * canachult(3i,y, z ), ecneohnlt(z, u, v) camchult(x,pro 3 (y,tt),v). 

Here prog(tt,v) is the program of first carrying out v. and then v. 

('Some hie 1 of identification of a Girdle action u with the one 
sr-3D r-ro,grata prcg(u) is obviously required, but the details of 
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ho*.-; this t;ill fit into the formalism have not yet been worked out) 

The final premise is the one which causes action to be taken. 

±6. x,canachult (x,prog (y,3 ) ,t?) * r?ant(w) -* do(y) • 

The argument the “advice taker” must produce in order to 
solve the problem deduces the following propositions in cor© or 
less the following order: 


lc 


2 . 

3. 

4. 

5. 

6 . 

v 

7, 


at (I,desk) can (go (desk, car, walking) ) 

at (I,car) •* c cue (go (heme, airport, driving 
did (go (don--.car,walking) ) ah(I,car) 
did (go (he"-: e , airport, driving) ) at (I, airport) 

eanachult (at (I,deak), go (desk, car,walking), at (I, car) ) 
canachnlf(at(I,car), go(home,airport,driving), at(I,airport) 

car, walking), go (home. 


> 



8, do(go(desk,car,walking) ) 


The deduction of the last proposition initiates action 


The above proposed reasoning raises two major questions of 
heuristic. The first is that of how the ±6 premises are collect¬ 
ed, and the second is that of how the deduction proceeds once 
they are found. He cannot give complete answers to either ques¬ 
tion in the present paper; they are obviously not completely 
separate since some of the deductions might be made before 
sore of the premises are collected. Let us first consider the 
question of where the 16 premises come from. 
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First of all, we assert that except for the 13 th premise 
(want{at(X,airport) ) which sets the goal) and the 1 st premise 
(at (X,desk) which we shall get from a routine which answers the 
question "where am X”), all the premises can reasonably he ex¬ 
pected to be specifically present in the memory of a machine 
which has competence of human order in finding its way around. 
That is, none of them are so specific to the problem at hand 
that assuming their presence in memory constitutes an anticipa¬ 
tion of this particular problem or of a class of problems nar¬ 
rower than those which any human can expect to have previously 
solved, he must impose this requirement if we arc to be able to 
say that the "advice taker" exhibita common sense. 

On the other hand, while we may reasonably assume that the 
premises are in memory, we still have to describe how they are 
assembled into a list by themselves to which the deduction 
routine.may be applied. Tentatively, we expect the "advice 
taker" to proceed as follows: initially, the sentence "want£afc(X 
airport) )" is on a certain list L, called the main list, all 
by itself. The program begins with an observation routine which 
looks at the main list and puts certain statements .about the con¬ 
tents of this list on a list called "observations of the main 
list". We shall not specify at present what all the possible 
outputs of this observation routine are but merely say that in 
this case it will observe that "the only statement on L has 
the form # want(u(x) }U” (he write this cut in English because 
we have not yet settled on formalism for representing state¬ 
ments of this kind.) The "deduce and obey" routine is then ap¬ 
plied to the combination of the "observations of 'the main list” 


list, and a list called the "standing orders lisi 


If * 


This list 


is rather small and is never changed, or at least : 1 s only changed 
in major changes of the advice taker1 The contents of the "stand¬ 
ing orders" list has not been worked out, but what must he fie- 


fcempts to copy the statements on its property list. Let us as-. 
sums that it fails in this attempt because "want(at(X,airport) )" 
does not have the status of an object and hence has no property 
list. (One might expect that if the problem of going to the 
airport had arisen before, "want(at(X,airport) )"would be air 
object, but. this might depend on whether there were routines 
fo.r generalising previous experience that would allow something 
of general use to be filed under that heading.) 1-Text in order 
of increasing generality the machine would see if anything'were 
filed under "want (at (l,x) )" which would deal with the general 
problem of getting somewhere. One would expect 'that premises 6, , 
(or 6* and 71), 8 , 9 , 12, would be so filed. There would also ' 
be the formula 


want(at(l,x) ) do(observe(where am X) ) 






-9- 


v 




which would give us praraisa 1. Thera would also be a reference 
to the naxt higfcar love! of aba traction in the goal stator^nt 
wnich would cause a look at the rronerty list of “want (x )", 

Tills would give us 14, ± 5 , and 1(5. 


We shall not try to follow the solution further except to 


)” there would be a rule that starts 



, and want{l,x 5 u and has as conclusion 
a searcn for the property list of "go(y,x,s)". This would pre- 
•suraaoly fall, and then there would have to be heuristics that 
would initiate a search for a y such that "&t(l y y) a and H at( 

® ^ ). • This would fee done by looking on the property 
lists of tne origin and the destination and working* up". Then 
promise 9 would be found which has as one of its premises 
at(I,car). A repetition of the above would find premise 8 , 
which would complete the sot of promises since the other "at" 
premises would have been found as by-products of previous searches 


We hops that the presence of the heuristic rules mentioned 
on the - property lists where we have put them will seem plausible 
so the reader. It should be noticed that on the higher level of 
ac>3erection inany of the statements are of the stimulus-response 
form. One might conjecture that division in man between con¬ 
scious and unconscious thougut occurs at the boundary between 
stimulus-response heuristics which do not have to be reasoned 
about but only obeysu, and the others which have to serve as 
premises in deductions. 


. We hope to formalise the hour! 
we start programming the system. 
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